Multi-Level Weighted Sequential Pattern Mining Based on Prime Encoding

نویسندگان

  • Yun Li
  • Lianglei Sun
  • Jiang Yin
  • Wenyan Bao
  • Mengyuan Gu
چکیده

Encoding can express the hierarchical relationship in the area of mining the multi-level sequential pattern, up to now all the algorithms of which find frequent sequences just according to frequency, but items have different importance in the real applications, therefore the weight constraint involved to the entire mining process is crucial. The MWSP algorithm based on the candidate generation-and-test approach is one of the best algorithms of the weighted sequential pattern mining, however, which will easily generate the situation of candidate combinatorial explosion during the mining process. Therefore, this paper presents the algorithm PMWSM, which adopts prime encoding to decide the parent-child relationship between different levels by one step of division operation, introduces the concept of K-minimum weighted support count to push weight constraint into the multi-level sequential pattern mining, utilizes the principle of prefix projection database to avoid the occurrence of candidate combinatorial explosion, and takes full advantage of the minimum weighted support count to optimize the algorithm. The experimental results show that the algorithm PMWSM is more effective than the algorithm MWSP on mining multi-level weighted sequential patterns from the sequence database.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sequential Pattern Mining by Pattern-Growth: Principles and Extensions

Sequential pattern mining is an important data mining problem with broad applications. However, it is also a challenging problem since the mining may have to generate or examine a combinatorially explosive number of intermediate subsequences. Recent studies have developed two major classes of sequential pattern mining methods: (1) a candidate generation-and-test approach, represented by (i) GSP...

متن کامل

Prism: An effective approach for frequent sequence mining via prime-block encoding

Article history: Received 30 September 2007 Received in revised form 4 June 2008 Available online 23 May 2009

متن کامل

A Novel Weighted Support Method for Access Pattern Mining

Sequential Pattern Mining is an important data mining technique that finds out all frequent sequential patterns in a sequence database. Applications in wide range of important domains make Sequential Pattern Mining an interesting area of research. Conventional approach for sequential pattern mining treats each and every item in the sequence with equal importance and thus fails to reflect the in...

متن کامل

Performance Analysis of web page recommendation algorithm based on weighted sequential patterns and markov model

Web usage mining techniques helps the users to predict the required Web page recommendations. In recent times, there has been a considerable significance given to sequential mining approaches to construct web page recommendation systems. This paper focuses on developing a web page recommendation approach for accessing related web pages more efficiently and effectively using weighted sequential ...

متن کامل

Mining rare sequential patterns with ASP

This article presents an approach of meaningful rare sequential pattern mining based on the declarative programming paradigm of Answer Set Programming (ASP). The setting of rare sequential pattern mining is introduced. Our ASP approach provides an easy manner to encode expert constraints on expected patterns to cope with the huge amount of meaningless rare patterns. Encodings are presented and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JDCTA

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2010